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An  Extension  of  the  Two-Parameter  Logistic  Model 
to  the  Multidimensional  Latent  Space 


Item  response  theory  (IRT)  has  proven  to  be  a  very  powerful  and  useful 
measurement  tool.  However,  most  of  the  IRT  models  that  have  been  proposed, 
and  all  of  the  models  commonly  used,  require  the  assumption  of  unidimensionality, 
which  prevents  their  application  to  a  wide  range  of  tests.  The  few  models 
that  have  been  proposed  for  use  with  multidimensional  data  have  not  been 
developed  to  the  point  that  they  can  be  applied  in  actual  testing  situations. 

The  purpose  of  this  report  is  to  present  a  model  for  use  with  multidimensional 
data  and  to  discuss  some  of  its  characteristics.  This  discussion  will 
include  information  on  the  interpretation  of  the  model  parameters,  the 
sufficient  statistics  for  the  model  parameters,  and  the  information  function 
for  the  model.  In  addition,  a  procedure  for  estimating  the  parameters  of 
the  model  will  be  discussed. 


The  Model  and  Its  Characteristics 


The  Model 


The  model  proposed  in  this  report  is  a  multidimensional  extension  of 
the  two-parameter  logistic  model.  The  two-parameter  logistic  (2PL)  model, 
proposed  by  Birnbaum  (1968),  is  given  by 


exp (Da  (9.  -  b  )) 

p  (a  \  —  _ _ i. - J - ! - 

1  j  l  +  exp(Dai(0j  -  b1>) 


(1) 


where  a.  is  the  discrimination  parameter  for  item  i,  b.  is  the  difficulty 
parameter  for  item  i,  0^  is  the  ability  parameter  for  examinee  j,  P^(0^)  is 


the  probability  of  a  correct  response  to  item  i  by  examinee  j,  and  D  =  1.7. 
The  multidimensional  extension  of  the  2PL  model  (M2PL),  as  presented  by 
McKinley  and  Reckase  (1982),  is  given  by 


W 


exp (d .  +  a  .  0  ) 

_ 1 _ -I-} _ 

1  +  exp(d.  +  a  .  0  .  )  ’ 
i  -  l  —  j 


(2) 


where  is  a  row  vector  of  discrimination  parameters  for  item  i,  0.  is  a 
column  vector  of  ability  parameters  for  examinee  j,  P^(0^)  is  the  probability 
of  a  correct  response  to  item  i  by  examinee  j,  and  d^  is  given  by 


m 


-l  a 
k=l 


ikbik  ’ 


(3) 


where  a.^  is  the  discrimination  parameter  for  item  i  on  dimension  k,  b.^  is 

the  difficulty  parameter  for  item  i  on  dimension  k,  and  m  is  the  number  of 
dimensions  being  modeled.  The  d^  term,  then,  is  related  to  item  difficulty, 


1 


2 


but  is  not  a  difficulty  parameter  in  the  same  sense  as  the  parameter  is 
in  the  unidimensional  model. 


Interpretation  of  the  Model  Parameters 


The  interpretation  of  the  parameters  of  unidimensional  IRT  models  is 
closely  tied  to  the  item  characteristic  curve  (the  regression  of  item  score 
on  ability) .  The  item  difficulty  parameter  is  defined  as  the  point  on  the 
ability  scale  where  the  point  of  inflection  of  the  item  characteristic 
curve  (ICC)  occurs.  This  is  equivalent  to  saying  the  item  difficulty  value 
is  the  point  on  the  ability  scale  where  the  second  derivative  of  the  ICC 
function  is  equal  to  zero.  For  the  2PL  model,  the  second  derivative  is 
given  by 


=  D2ai2PQ(l  -  2P) 


where  P  is  the  probability  of  a  correct  response  to  item  i  given  ability  j, 
Q  =  1-P,  and  a.  and  D  are  as  previously  defined.  Setting  the  right  hand 

side  of  (4)  equal  to  zero  yields  a  solution  of  ?  =  Q  =  0.5.  Of  course, 

P  =  1.0  and  P  =  0.0  are  also  solutions,  but  these  represent  degenerate 
cases  where  0  =  +<»  and  0  =  respectively.  Thus,  the  point  of  inflection 

occurs  at  P  =  0.5,  which  occurs  where  b.  =0..  The  difficulty  of  an  item 
.  ^  J 

for  the  2PL  model,  then,  is  the  point  on  the  ability  scale  which  yields  a 
probability  of  a  correct  response  equal  to  0.5.  Figure  1  shows  a  typical 
ICC  for  the  2PL  model.  The  dotted  line  shows  the  relationship  among  the 
item  difficulty  value,  ability,  and  the  probability  of  a  correct  response. 


Figure  1 

A  Typical  ICC  for  the  2PL  Model 
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The  item  discrimination  parameter  is  related  to  the  slope  of  the  ICC 
at  the  point  of  inflection.  The  slope  of  the  ICC  at  the  point  of  inflection 
is  found  by  taking  the  first  derivative  of  the  ICC  and  evaluating  it  at  the 
point  of  inflection.  For  the  2PL  model  the  first  derivative  is  given  by 


DatPQ  , 


(5) 


where  all  the  terms  are  as  previously  defined.  It  was  previously  found 
that  the  point  of  inflection  for  the  2PL  model  occurs  where  P  =  0.5. 
Substituting  0.5  into  (5)  yields  a  slope  at  the  point  of  inflection  of 
Da ,/4. 

l 

Difficulty  and  discrimination  are  defined  somewhat  differently  for 
multidimensional  models.  To  begin  with,  the  response  function  (the  model) 
defines  a  multidimensional  item  response  surface  (IRS)  rather  than  a  curve. 
This  surface  may  have  many  points  of  inflection,  and  the  points  of  inflection 
may  vary  depending  on  the  direction  relative  to  the  8-axes.  Because  of  this, 
the  item  parameters  for  the  M2PL  model  are  defined  in  terms  of  directional 
derivatives  (Kaplan,  1952). 


For  multidimensional  models,  difficulty  is  defined  as  the  locus  of 
points  of  inflection  of  the  IRS  for  a  particular  direction.  This  is  found 
by  taking  the  second  directional  derivative  of  the  response  function, 
setting  it  .equal  to  zero,  and  solving  for  the  0-vector.  The  second  directional 
derivative  for  the  M2PL  model  is  given  by 


where  $  represents  the  vector  of  angles  with  respect  to  each  of  the  m  axes. 
Solving  the  derivatives  in  (6)  and  simplifying  yields 

V  2P  =  PQ(1  -  2P)  (aicositi  +  ancos^i  +  .  .  .  +  a  cos4>  ) 

$  mm 


(7) 


«* 


Setting  (7)  equal  to  zero  and  solving  yields  three  solutions — P  =  0.0, 

P  =  0.5,  and  P  =  1.0.  The  solutions  0.0  and  1.0  represent  degenerate  cases 
where  0  =  ±®.  P  =  0.5  occurs  when  the  exponent  of  the  M2PL  model  is  zero. 
That  is  P  =  0.5  when 


d  +  aj0j  +  3282 


+  a  0 
m  m 


In  the  two-dimensional  case  this  is  the  equation  for  a  line. 

For  the  M2PL  model,  as  can  be  seen  from  the  above  derivations,  the 
direction,  $,  falls  out  of  the  equations.  Item  difficulty  for  the  M2PL  model 
is  the  same  for  all  directions  of  travel.  This  is  not  necessarily  the  case 
for  all  multidimensional  models. 

Figure  2  shows  a  typical  response  surface  for  the  M2PL  model  in  the 
two-dimensional  case.  The  dotted  line  indicates  the  line  of  difficulty. 

In  the  m-dimensional  case  (8)  is  the  equation  for  a  hyperplane. 


Figure  2 

A  Typical  Response  Surface  for  the  M2PL  Model 


For  multidimensional  models,  item  discrimination  is  a  function  of  the 
slope  of  the  IRS  at  the  locus  of  points  of  inflection  in  a  particular 
direction.  This  is  obtained  by  taking  the  first  directional  derivative  of 
the  response  function  and  evaluating  it  at  the  locus  of  points  of  inflection. 
For  the  M2PL  model  the  first  directional  derivative  is  given  by 


6P  .  6P  . 

V  =  607  cos^  +  s?7  cos*2 


where  $  represents  the  vector  of  angles  of  the  direction  in  the  0-space 
with  respect  to  each  of  the  m  axes.  For  the  two-dimensional  case  (9)  is 
given  by 


V  P  =  a^PQ  cos  <}>  +  a2pQ  sin  . 


where  0  is  the  angle  with  the  0*  axis.  When  0=0°  (direction  parallel  to 
0i  axis)  the  slope  is  aiPQ,  and  when  0  =  90°  (parallel  to  02  axis)  the  slope 
is  a2PQ.  In  general,  when  the  direction  is  parallel  to  the  0^  axis,  the  slope 

is  a  PQ.  Since  P  =  Q  =  0.5  at  the  line  of  inflection,  the  slope  parallel  to 
m 

the  0  axis  at  those  points  is  a  /4.  In  the  unidimensional  case  0=0°,  and 
m  m 

the  slope  of  the  ICC  at  the  point  of  inflection  is  Da/4. 

Sufficient  Statistics 

Definition  Assume  that  there  exists  some  distribution  that  is  of 
known  form  except  for  some  unknown  parameter  0,  and  that  x  represents  a  set 
of  observations  from  that  distribution.  Also  assume  that  S(x)  is  some 
statistic  which  is  a  function  of  x.  If  S(x)  is  a  sufficient  statistic  for 
0,  then  it  must  be  possible  to  factor  the  probability  function  of  x,  P(x|0) 
into  the  form: 

P(x(0)  =  f  (S(x)|0lg(x)  •  (ID 

In  this  form  it  is  easy  to  see  that  g(x)  is  independent  of  0,  and  so  provides 
no  information  about  0.  Selection  of  0  to  maximize  the  probability  of  x  is 
tantamount  to  selecting  0  to  maximize  the  probabilty  of  S(x). 

In  item  response  theory  x  is  typically  a  response  string,  either  by 
one  examinee  to  a  set  of  items  or  by  a  set  of  examinees  to  a  single  item. 

In  this  case,  P(x|0)  is  the  likelihood  of  the  response  string.  For  the 
M2PL  model,  the  likelihood  of  an  examinee's  response  string  is  given  by 


p(x . 1 0 .)  =  ri  p(x  |§  ) 

J  J  i=i  J 


where  x„  is  the  response  to  item  i  by  examinee  J  *  Is  the  vector  of 

abilities  for  examinee  j,  x^  is  the  response  string  for  examinee  j,  and  n 

is  the  number  of  items.  The  likelihood  of  the  set  of  responses  to  an  item 
is  given  by: 
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p(si  K-v  ■  ;1PUiJid1'S1  > 


where  P(x_|d^,  a^)  is  the  probability  of  response  x_  for  item  i,  d  and 

a^  are  the  item  parameters  for  item  i,  x^  is  the  vector  of  responses  to 

item  i,  and  N  is  the  number  of  examinees.  In  order  for  any  statistic  to  be 
a  sufficient  statistic  for  a  parameter  of  the  M2PL  model,  it  must  be  possible 
to  factor  the  appropriate  likelihood  function  into  the  form  given  by  (11). 

Sufficient  Statistic  for  the  Ability  Parameter  For  the  M2PL  model 
(12)  can  be  factored  into  the  form: 

n  n  n 

P(x.  li.  )  "  HQ  (6  )exp(£.  E  a  x  )exp(  Z  d  x  ) .  Q4) 

J  J  i=i  1  J  J  i=i  1  i=l  1  J 

From  (14)  it  can  be  seen  that 


4<ij> '  i-r1^  <15) 

is  a  vector  of  sufficient  statistics  for  0 ^ .  (For  a  discussion  of  the 

derivation  of  the  sufficient  statistic  for  ability  in  the  unidimensional 
case,  see  Lord  and  Novick,  1968,  chapter  18). 

Sufficient  Statistics  for  the  Item  Parameters  For  the  item  parameters 
of  the  M2PL  model,  (13)  can  be  factored  into  the  form: 

N  N  N 

-i  ldi‘-i  >  =  )exp(^i  l  I  j  xij)exp(di  E  Xij)  .  (16) 

^  J  ""  1 

From  (16)  it  can  be  seen  that 


sd(5i)  ‘  .E*ij 

J  =  1 


is  a  sufficient  statistic  for  the  d-parameter,  and 


s  (x  )  =  E  0  .  x 
-a  -i  j  =  1-J  U 

is  a  vector  of  sufficient  statistics  for  the  a-parameter. 


Information  Function 

Definition  In  item  response  theory  the  precision  of  estimates  based 
on  a  given  scoring  formula  are  generally  described  in  terms  of  the  information 
function  of  the  scoring  formula.  The  information  function  of  a  particular 


•*.  *<  4  -  •’  .*  * 
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) 

} 

scoring  formula,  as  given  by  Lord  and  Novick  (1968),  is  given  by 


i  |e,s(x)] - -! - —  e  fs(x)|e' 

L  4  o1  [S(x),e3|  1  59  L 

where  s(x)  represents  a  given  scoring  formula  for  the  model  of  interest, 
a2[s(X),0]  is  the  variance  of  the  scoring  formula,  and  the  derivative 
3E[s(X) jej/36  specifies  how  the  mean  of  the  scoring  formula  changes  as  0 
changes . 

If  s(x)  takes  the  form 


n 

s(x)  =  £  w  x.  , 

i=l 

where  w.  is  a  positive  number,  then  the  expected  value  E[s(X) |0] 
by  1 


(20) 
is  given 


e  [s(x)|e] 


I  w  P  (9)  , 
i=l 


(21) 


and  the  variance  of  the  scoring  formula  is  given  by 


a 


2 


n 

=  E  V  2P.(6)Q  (0). 
i=l 


(22) 


(For  a  discussion  of  these  derivations,  see  Lord  and  Novick,  1968). 
Substituting  (21)  and  (22)  into  (19)  yields 


r  n 

r n  i 

n 

i  L e,s(x) J  = 

l  V  2p  (9)Q  (9) 

~  1 

E  w  P  '(0) 

i=  1  1 

i=l 

i 

where  (0)  =3Pi(0)/30.  For  a  single  item  (23)  takes  the  form 
I  [ 9,s(x)J  =  P1'(9)2/Pi(9)Qi(9)  , 


(23) 


(24) 


which  is  the  item  information  function.  If  (24)  is  written  in  terms  of  the 
response  x.,  rather  than  the  scoring  formula  s(x),  the  same  result  is 
obtained.  1That  is,  1(0, x.)  =  I[0,s(x.)J.  Lord  and  Novick  (1968)  have 
shown  that,  unless  s(x)  represents  the  locally  best  weights  at  0, 

I(0,s(x)J  <  I  1(0, x.).  That  is,  the  sum  of  the  item  information  functions, 
which  is  independent  of  the  the  scoring  formula,  represents  an  upper  bound 
on  each  and  all  information  functions  obtained  using  different  scoring 
formulas.  The  sum  of  the  item  information  functions  is  called  the  test 
information  function,  and  is  given  by 
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n  n 

1(6)  =  Z  1(6, x.)  »  I  P'(6)2/P  (8)Q  (6).  (25) 

i=l  i=l  1 

Information  Functions  for  the  M2PL  Model  For  the  unidimensional  2PL 
model,  given  by  (1),  the  item  information  function  is  given  by 

1(6, x±)  =  D2ai2Pi(6)Qi(6)  .  (26) 

Test  information  for  the  unidimensional  2PL  model  is  given  by 

n 

1(6)  =  Z  D2a. 2P  (0)Q. (0) •  (27) 

i-1  11  1 

As  was  the  case  for  discrimination,  information  for  the  M2PL  model 
varies  depending  on  the  direction  relative  to  the  0-axes.  Therefore, 
item  and  test  information  for  the  M2PL  model  are  defined  using  the  first 
directional  derivative  of  the  response  function,  which  is  given  by  (9). 
Item  information  for  the  M2PL  model  is  given  by 


1(0  ,x^)  ■  a*2  PQ  cos2$j  +  a22  PQ  cos2i^2  +  .  .  .  +  a^2  PQ  cos24>m  + 
2a.\B2  PQ  cos$j  cosi(i2  +  .  .  .  +  2ajam  PQ  cos<J>i  cos<f>m  + 

(28) 


2a,  ..a  PQ  cos<J> ,  ..cost 

(m-l)m  x  (m  -  1 )  m 


For  the  two  dimensional  case,  this  simplifies  to 

2 

I(£,x^)  =  PQ(a!Cos$  +  a2sin$) 


(29) 


Note  that  when  the  direction  of  travel  is  parallel  to  the  9-axis  (<|>  =  0°), 
item  information  is  given  by  ax2PQ.  When  only  02  is  of  interest  ($  =  90°), 
item  information  is  given  by  a22PQ.  If  the  two  dimensions  are  weighted  equally 
(4*  =  95°),  item  information  is  given  by  0.5(a!2PQ  +  2aj  a2PQ  +  a22PQ) •  Figures 
3,  4,  and  5  show  typical  item  information  surfaces  for  4>  =  0°,  45°,  and  90° 
respectively.  Note  that  these  are  not  the  same  surface  seen  from  different 
angles.  They  are  different  surfaces,  all  for  the  same  item,  obtained  by 
varying  the  direction  with  respect  to  the  6-axes.  As  can  be  seen,  they  are 
quite  different.  Test  information  for  the  M2PL  model  is  simply  the  sum  of 
(29)  over  all  of  the  items.  Figures  6,  7,  and  8  show  typical  test  information 
surfaces  for  4  =  0°,  45°,  and  90°,  respectively.  Again,  the  three  surfaces 
are  quite  different,  indicating  that  the  test  gives  different  amounts  of 
information  that  are  concentrated  at  different  places  in  the  0-space  when 
different  weighted  composites  of  ability  are  of  interest. 


Maximum  Likelihood  Estimation 


Maximum  likelihood  estimation  of  the  parameters  of  the  M2PL  model  is 
relatively  straightforward.  The  likelihood  of  a  response  matrix  for  the 
M2PL  model  (or  for  any  latent  trait  model)  is  given  by 


N 


l  =  n  n  P(x  ) 
i=l  j*l  13 


(30) 


where  all  the  terms  have  been  previously  defined.  For  an  examinee's  response 
string,  the  likelihood  is  given  by  (12),  and  the  likelihood  of  a  response 
string  for  an  item  is  given  by  (13).  The  first  derivative  of  ..fie  loge  of 
the  likelihood  given  in  (12)  is  given  by: 
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and  the  first  derivative  of  the  log  of  the  likelihood  given  in  (13)  is 


given  by 
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for  the  difficulty  parameter,  and 
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for  the  discrimination  parameter. 


The  estimation  of  ability  using  maximum  likelihood  techniques  simply 


involves  setting  (31)  equal  to  zero  and  solving  for  0^ .  Of  course,  since 


this  involves  solving  simultaneous  nonlinear  equations,  some  type  of 
iterative  procedure  is  generally  required.  The  estimation  of  item  parameters 
involves  setting  (321  and  (33)  equal  to  zero  and  solving  for  d.  and  a., 


An  Item  Information  Surface 


Figure  6 


A  Test  Information  Surface 


respectively.  Again,  the  solution  of  simultaneous  nonlinear  equations 
requires  an  iterative  procedure.  McKinley  and  Reckase  (1983)  describe  a 
procedure  for  the  simultaneous  estimation  of  the  item  and  person  parameters 
of  the  M2PL  model  using  a  Newton-Raphson  procedure  for  solving  the 
simultaneous  nonlinear  equations.  A  computer  program  is  available. 


Discussion 

Although  IRT  has  gained  popularity  over  the  last  few  years,  applications 
of  IRT  models  have  been  limited  to  tests  for  which  the  assumption  of 
unidimensionality  is  at  least  defensible.  There  have  been  a  few  IRT  models 
proposed  for  use  with  multidimensional  data  (see  McKinley  and  Reckase,  1982, 
for  a  summary),  but  there  have  been  few  successful  attempts  at  their 
application.  Use  of  these  models  has  been  limited  due  to  the  absence  of 
practical  algorithms  for  parameter  estimation,  and,  at  least  in  part,  because 
the  models  are  not  well  understood. 

McKinley  and  Reckase  (1982)  have  proposed  a  model,  the  M2PL  model,  for 
use  with  multidimensional  data,  and  they  have  developed  a  program  for  the 
estimation  of  the  parameters  of  the  model  (McKinley  and  Reckase,  1983). 

The  purpose  of  this  report  is  to  provide  information  necessary  for  the 
understanding  and  use  of  the  M2PL  model. 

Many  of  the  characteristics  of  the  M2PL  model  are  not  straightforward 
extensions  from  the  unidimensional  case.  Rather,  the  unidimensional  case 
is  a  special  case  of  the  multidimensional  model  in  which  much  of  the  richness 
and  complexity  of  the  model  is  not  evident.  Because  of  this,  some  of  the 
characteristics  of  the  model  described  in  this  report  may  be  somewhat 
difficult  to  grasp.  In  order  to  aid  in  the  understanding  of  these  character¬ 
istics,  they  will  now  be  discussed  in  some  depth.  An  attempt  will  be  made 
in  each  case  to  describe  how  the  information  provided  relates  to  real-world 
applications.  The  discussion  will  begin  with  the  interpretation  of  the 
model  parameters,  and  will  include  the  sufficient  statistics,  information 
functions,  and  parameter  estimation.  Before  beginning  the  discussion  of 
the  characteristics  of  the  M2PL  model,  however,  a  brief  discussion  of 
directional  derivatives  will  be  presented,  since  directional  derivatives 
are  so  important  to  the  understanding  of  multidimensional  IRT  models. 


Directional  Derivatives 


One  of  the  most  interesting  and  complex  aspects  of  the  multidimensional 
IRT  models  which  is  lost  when  the  unidimensional  case  is  discussed  is  the 
notion  of  directional  derivatives.  In  the  unidimensional  case  the  only 
direction  ever  discussed  is  parallel  to  the  0-axis  (<{>  =  0°),  in  which  case 
all  the  trigonometric  terms  so  evident  in  the  material  presented  above  are 
absent--they  always  equal  1.0  or  0.0  and  therefore  drop  out  of  the  equations. 

Directional  derivatives  are  necessary  in  the  multidimensional  case 
because  the  derivatives  of  the  response  function  vary  depending  on  the 
direction  taken  relative  to  the  0-axes.  The  first  derivative  of  a  function 
gives  the  slope  of  the  function  at  any  given  point.  The  second  derivative 
gives  the  rate  at  which  the  slope  is  changing  at  a  particular  point.  The 
point  of  maximum  slope  is  where  the  slope  stops  increasing  and  starts 
decreasing.  At  the  point  where  that  change  occurs,  the  change  in  slope 


crosses  from  being  positive  (increasing)  to  negative  (decreasing).  Thus, 
at  that  point  the  change  in  slope  is  neither  positive  nor  negative,  but 
rather  is  zero.  Since  the  second  derivative  gives  the  rate  of  change  of 
slope,  the  point  of  maximum  slope  is  where  the  second  derivative  is  zero. 

In  the  unidimensional  case,  this  has  a  straightforward  application  in 
determining  item  difficulty  and  discrimination,  as  illustrated  in  Figure  1. 
The  point  where  the  dotted  line  crosses  the  ICC  is  the  point  of  maximum  slope 
and  minimum  (zero)  change  in  slope. 

In  Figure  2,  it  can  easily  be  seen  that  there  is  no  one  point  on  the 
surface  where  the  slope  is  at  a  maximum.  Moreover,  for  any  one  point,  the 
slope  varies  depending  on  the  direction.  Consider  the  point  on  the  surface 
where  0!  =  0.0  and  02  =  -2.5.  This  point  is  indicated  on  the  surface  by  an 
x.  Moving  along  the  02  =  -2.5  line  parallel  to  the  0X  axis,  the  surface  is 
rising  fairly  rapidly  at  the  point  indicated.  However,  moving  along  the 
0j  =  0  line  parallel  to  the  02-axis,  the  surface  is  still  relatively  flat 
and  is  rising  slowly.  Clearly  the  slope  of  the  surface  is  different 
depending  on  the  direction  of  travel.  The  same  is  true  of  the  change  in 
slope.  Because  of  this,  when  taking  derivatives  of  a  multidimensional 
response  function,  it  is  necessary  to  consider  the  direction.  Directional 
derivatives  are  a  way  of  doing  this.  The  actual  interpretation  of  the 
derivatives  in  different  directions  will  be  addressed  in  the  next  section, 
since  it  is  closely  related  to  the  interpretation  of  the  model  parameters. 


Interpretation  of  the  Model 


A  straightforward  extension  of  item  difficulty  from  the  unidimensional 
to  the  multidimensional  case  would  seem  to  lead  to  the  conclusion  that 
difficulty  in  the  multidimensional  case  ought  to  be  a  vector  of  b-parameters , 
with  one  b  for  each  dimension.  In  Figure  1  the  b-parameter  is  the  point  on 
the  0-scale  below  the  point  of  inflection.  It  represents  the  point  on  the 
0-scale  where  the  item  best  discriminates  between  high  and  low  ability.  At 
the  point  represented  by  the  b-parameter,  a  very  small  change  in  ability 
corresponds  to  a  large  change  in  the  probability  of  a  correct  response. 
Nowhere  on  the  0-scale  does  an  equal  change  in  ability  result  in  as  large  a 
change  in  the  probability  of  a  correct  response.  Thus,  in  the  unidimensional 
case,  the  item  difficulty  parameter  indicates  the  point  on  the  ability  scale 
at  which  the  item  does  the  best  job  of  discriminating  between  different 
levels  of  ability. 


On  the  surface,  it  seems  logical  to  conclude  that  if  there  are  two 
dimensions,  there  should  be  two  b-parameters.  One  b-parameter  should 
indicate  the  point  of  maximum  discrimination  on  one  dimension,  while  the 
other  b-parameters  indicate  the  point  of  maximum  discrimination  on  the 
other  dimension.  Figure  2,  however,  clearly  illustrates  that  this  is 
inadequate . 


As  can  be  seen  in  Figure  2,  the  two  ability  dimensions  do  not  act 
independently.  It  is  the  combination  of  ability  on  the  two  dimensions 
which  determines  the  probability  of  a  correct  response.  An  examinee  with 
02  =  2.0  clearly  has  a  higher  ability  on  that  dimension  than  an  examinee 
with  02  =  -2.0.  However,  if  the  second  examinee  has  0]  =  3.0,  while  the 
first  examinee  has  ©!  =  -3.0,  the  second  examinee  has  a  much  higher 
probability  of  a  correct  response  to  the  item  described  by  the  IRS.  Clearly, 
then,  considering  the  two  dimensions  separately  does  not  contribute  greatly 
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to  discriminating  between  examinees  who  have  different  probabilities  of  a 
correct  response.  This  is  reflected  in  the  fact  that  the  item  difficulty 
for  Figure  2  is  a  line  which  is  not  parallel  to  either  axis. 

This  has  important  implications  for  test  construction  and  analysis 
using  the  M2PL  model.  It  is  common  practice,  for  instance,  to  order  items 
on  a  test  by  difficulty,  or  to  construct  a  test  having  a  specified 
distribution  of  item  difficulty.  In  the  unidimensional  case  this  is  done 
using  the  b-parameter.  Clearly  in  the  multidimensional  case  it  is  more 
complicated.  An  item  having  a  smaller  d-parameter  than  a  second  item  is  only 
uniformly  more  difficult  than  the  second  item  if  their  difficulty  functions 
are  parallel.  If  the  difficulty  functions  intersect,  then  item  1  is  more 
difficult  than  item  2  in  some  regions  of  the  8-plane,  while  item  2  is  more 
difficult  in  other  regions. 

This  would  seem  to  indicate  that  it  is  only  reasonable,  in  the  multi¬ 
dimensional  case,  to  talk  about  ordering  items  on  difficulty  or  obtaining  a 
specified  distribution  of  difficulty  if  all  the  items  to  be  considered  have 
parallel  lines  of  difficulty.  Of  course,  in  the  m-dimensional  case  the 
items  would  all  have  to  have  parallel  (m- 1 )-dimensional  hyperplanes. 

In  order  to  determine  whether  two  items  have  parallel  lines  of  difficulty 
in  the  two-dimensional  case,  first  determine  the  form  of  the  difficulty  line. 
The  equation  for  the  line  of  difficulty  is  given  by  (8).  The  two  lines  are 
parallel  only  if  the  slopes  of  the  lines  are  equal.  Putting  (8)  into  a 
slope-intercept  form  yields 
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where  a^  is  the  item  discrimination  parameter  for  item  i  for  dimension  1, 

a^  is  for  dimension  2,  0^  is  the  ability  parameter  for  examinee  j  for 

dimension  1,  and  0^  is  the  ability  parameter  for  dimension  2.  If  item  2  is 

denoted  by  a  prime  ('),  then  the  lines  of  difficulty  for  items  1  and  2  are 
parallel  only  if 
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If  all  items  meet  the  condition  set  out  in  (35),  then  they  can  be  ordered 
by  difficulty,  by  simply  ordering  them  by  their  d-parameters . 

The  ordering  of  items  on  difficulty  implies  that  there  is  some  underlying 
variable  being  measured  that  has  some  correspondence  to  the  criterion  used 
for  the  ordering.  In  this  case  there  is  some  composite  of  the  0s  which 
corresponds  to  the  difficulty  continuum  formed  by  the  items  having  parallel 
lines  of  difficulty.  The  composite  is  determined  by  the  orientation  of  the 
lines  of  difficulty. 

The  extension  of  item  discrimination  to  the  multidimensional  case  is 
even  more  complex  than  the  extension  of  item  difficulty.  Unlike  difficulty, 
the  concept  of  item  discrimination  in  the  multidimensional  case  includes  a 
consideration  of  direction--the  angles  indicating  the  direction  do  not  fall 
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out  of  the  equations.  Although  the  slope  of  the  IRS  shown  in  Figure  2  is 
constant  all  along  the  the  line  of  difficulty  for  a  given  direction,  it 
varies  with  different  directions. 

The  need  to  consider  direction  has  important  implications  for  both 
test  construction  and  test  analysis.  It  is  not  enough  in  test  construction, 
for  instance,  to  merely  select  the  item  with  the  highest  discrimination 
from  among  the  available  items.  One  item  is  uniformly  more  discriminating 
than  a  second  item  only  if  the  slope  of  its  IRS  at  the  points  of  inflection 
is  greater  than  the  slope  of  the  IRS  for  the  second  item  for  all  directions. 

If  item  1  has  the  higher  a-value  on  one  dimension,  but  a  lower  a-value  on 
another  dimension,  there  may  be  directions  for  which  the  slope  of  the  IRS  at 
the  points  of  inflection  will  be  greater  for  item  2.  For  example,  consider 
the  case  where  item  1  has  discrimination  parameters  a  =  (1.0,  0.5)  and  item 
2  has  discrimination  parameters  a  =  (0.5,  1.0).  When  <J>  in  (10)  is  30  degrees, 
the  slope  of  the  IRS  at  the  points  of  inflection  is  0.279  for  item  1  and  0.233 
for  item  2.  When  <j>  is  60  degrees,  the  slope  for  item  1  is  0.233,  while  the 
slope  for  item  2  is  0.279.  At  <))  =  45  degrees,  both  items  have  a  slope  of 
0.265  at  the  points  of  inflection. 

It  seems  to  follow  from  the  above  discussion  and  example  that,  in 
interpreting  item  discrimination  in  the  multidimensional  case,  the  particular 
composite  of  abilities  of  interest  must  be  considered.  The  composite  might 
be  specified  a  priori,  as  in  test  construction,  or  discovered  by  post 
administration  analyses. 

Sufficient  Statistics 

The  notion  of  a  sufficient  statistic  is  not  a  simple  one  to  grasp. 
Essentially,  a  statistic  t  is  a  sufficient  statistic  for  the  parameter  0  if 
it  contains  all  the  information  in  the  sample  data  about  0.  For  example, 
the  number-correct  score  for  an  item  provides  all  the  information  in  the 
response  data  about  the  d-parameter.  For  the  a-parameter  for  a  particular 
dimension,  a  sufficient  statistic  is  provided  by  a  weighted  sum  of  the  item 
responses.  The  response  of  each  examinee  to  the  item  of  interest  is  weighted 
by  the  examinee's  ability  on  the  dimension  of  interest.  Thus,  a  correct 
response  to  an  item  by  an  examinee  of  high  ability  (0  >  0.0)  adds  to  the 
value  of  the  statistic,  while  a  correct  response  by  an  examinee  of  low 
ability  (0  <  0.0)  decreases  the  value. 

For  ability,  a  sufficient  statistic  is  provided  by  a  weighted  sum  of 
an  examinee's  responses  to  all  the  items.  The  weighting  factor  is  the 
discrimination  parameter  for  the  dimension  of  interest.  Thus,  a  correct 
response  to  a  highly  discriminating  item  adds  more  to  the  statistic  than  a 
correct  response  to  an  item  of  low  discriminating  power. 

Although  the  availability  of  sufficient  statistics  for  the  parameters 
of  the  M2PL  model  is  important  from  an  estimation  standpoint,  it  should  be 
pointed  out  that,  with  the  exception  of  the  d-parameter,  the  sufficient 
statistics  described  above  are  not  observable.  While  the  number-correct 
score  of  an  item  can  be  observed,  the  a-parameter  of  an  item,  and  thus  the 
sum  of  item  responses  weighted  by  discrimination  parameters,  is  not  observable. 
This  complicates  estimation  somewhat  by  requiring  that  provisional  estimates 
of  some  parameters  be  provided  during  the  estimation  of  the  remaining 
parameters.  Solutions,  then,  are  obtained  by  a  series  of  approximations  by 
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varying  from  one  step  to  the  next  which  parameters  are  estimated.  In  each 
step  the  provisional  estimates  for  the  parameters  not  being  estimated  are 
the  most  recent  estimates  of  those  parameters. 

Information  Function 

Item  information  in  the  multidimensional  case  is  like  item  discrimina¬ 
tion  in  that  the  information  yielded  by  an  item  for  a  particular  0  varies 
with  the  direction  of  travel.  This  has  important  implications  for  such 
applications  as  adaptive  testing,  in  which  items  may  be  selected  for 
administration  on  the  basis  of  item  information.  As  was  the  case  with  item 
discrimination,  the  interpretation  and  use  of  item  information  requires  the 
consideration  of  the  particular  composite  of  abilities  which  is  of  interest. 

Maximum  Likelihood  Estimation 

Estimation  of  the  parameters  of  the  M2PL  model  is  surprisingly  straight¬ 
forward.  Implementation  of  the  procedure  described  earlier  in  this  report 
is  not  particularly  difficult.  However,  it  is  rather  expensive. 

One  serious  limitation  of  the  procedure  described  is  that  there  is  no 
way  to  determine  in  advance  how  many  dimensions  should  be  included.  The 
procedure  is  too  expensive  to  allow  successive  runs  for  an  increasing 
number  of  dimensions  until  a  satisfactory  solution  is  obtained.  It  is 
clear  that,  if  the  M2PL  model  is  to  be  used,  more  work  is  needed  in  this 
area . 

More  work  also  needs  to  be  done  to  determine  sample  size  requirements 
for  estimation.  Some  guidelines  are  needed  for  determining  the  maximum 
number  of  items  and  subjects  required  for  good  estimation. 


Summary 


Item  response  theory  has  become  an  increasingly  popular  area  for 
research  and  application  in  recent  years.  Areas  where  item  response  theory 
has  been  applied  include  test  scoring  (Woodcock,  1974),  criterion-referenced 
measurement  (Hambleton,  Swaminathan,  Cook,  Eignor,  and  Gifford,  1978),  test 
equating  (Marco,  1977;  Rentz  and  Bashaw,  1977),  adaptive  testing  (McKinley 
and  Reckase,  1980),  and  mastery  testing  (Patience  and  Reckase,  1978). 

While  many  of  these  applications  have  been  successful,  one  unsolved  problem 
is  repeatedly  encountered--most  IRT  models  assume  unidimensionality.  As  a 
result,  applications  of  these  models  have  been  limited  to  areas  in  which 
the  tests  used  measure  predominantly  one  factor  (or  can  be  sorted  into 
subtests  which  measure  predominantly  one  factor).  When  the  assumption  of 
unidimensionality  is  not  met,  most  IRT  models  are  inappropriate. 


The  purpose  of  this  report  is  to  present  an  IRT  model  that  does  not 
require  unidimensional  tests.  With  such  a  model  the  great  power  of  item 
response  theory  as  a  measurement  tool  can  be  applied  for  many  of  the  pur¬ 
poses  for  which  unidimensional  models  are  employed,  without  the  limitation 
on  what  kinds  of  tests  are  involved  (i.e.,  the  dimensionality  of  the  tests). 
Of  course,  much  more  work  is  needed  before  the  model  can  be  employed  in 
reaL  testing  situations.  Procedures  for  the  use  of  the  model  for  different 


applications  must  be  worked  out  in  greater  detail,  and  limitations  on  the 
practicality  of  the  estimation  procedures  must  be  overcome.  The  information 
provided  in  this  report  provides  a  firm  foundation  for  future  work  in  these 
areas . 
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University  of  Notre  Dime 
Notre  Dame,  IN  AASSA 

1  Or.  Simi‘1  T.  Mayo 

Loyol  i  University  of  Cilcat’o 
S2U  North  Michigan  Avenu  • 

Chi  r.aijo ,  IL  bOSI  1 

1  Mr.  Roh'rt  McKinley 

American  Col  lei*'*  Testing  Programs 

P.O.  Box  I6B 

lowi  City.  IA  '>22<^^ 

I  Or.  Barbara  M'uns 

On  nan  R ‘sources  Research  Organization 
V).)  North  Wishlnqtoo 
Alexandria,  VA  2211A 

I  Dr.  Roh'rt  Mlslcvy 
711  III Inols  Street 
Geneva,  IL  AOllA 

I  Or.  Al len  Munro 

Behavioral  Technology  Laboratories 
IRAS  F.lena  Ave.  ,  Fourth  Floor 
Redondo  Reach,  C\  90277 

l  Or.  W.  Alan  NIC'*wandor 
University  of  Oklahoaii 
Department  of  Psychology 
Oklahoma  City,  OK  710S9 

I  Or.  M’lvln  R.  Novlck 

1S6  Lindquist  Center  for  M*asurm*nt 
University  of  Iowa 
low i  City,  IA  512A2 


Private  Sector 


Private  Sector 


1  Dr.  Jms  Olson 
WICAT,  Inc. 

1875  South  State  Strode 
Orem,  Ur  8405/ 

I  Wayne  M.  Patience 

Amebic  in  Council  on  Edncitlon 
GEO  Testing  Service,  S' 1 1 1 «■  2  ) 
Or  Dupont  Clrle,  N4 
Wishlnston,  1)0  20115 

1  Or.  .)  ini  •!  A.  P  »ul  son 

Portland  St. ite  University 
P.O.  Box  751 
Portlml.  OK  972  >7 

I  Or.  M  irk  0.  R-'ck  is  • 

ACT 

P.  O.  Box  168 
town  Glry.  TA  522 VI 

I  Or.  Thoms  Reynolds 

University  of  Tex.is-0  il  1  ns 
Market  ln«»  D'partm''nt 
P.  0.  Box  BBS 
Rich  irdson,  TX  75080 

1  Or.  L  iwrenee  Rudner 
ATI  Klin  \venu  • 

Takomi  Park.  MO  20012 

1  Or.  J.  Ryan 

Department  of  Education 
University  of  South  Cirollni 
Columbia,  SC  2920S 


1  Lowell  Scho’r 

Psychological  &  Qmnt  1 1  it  1  vo 
Found  it  1 ons 
College  of  Education 
University  of  lows 
l ow i  City.  t\  52242 

I  DR.  R01ER  T  ,1.  SEIDEL 

l NSTRUC  TtON AL  TECHNOLOGY  GROUP 
IIU'dRKO 

no  Washington  sr. 

ALEXANIIR  I  \  ,  VI  22  115 

1  Or .  K  i7.il)  Sh  Ir  »m  iso 
University  of  Toll  >ku 
O-pirtum'.  of  Elucitinnil  Psychology 
Kaw.iuehl  ,  S'*ndil  OB') 

JAP  VI 

1  Or.  Edwin  Shi rk ‘v 

0'p'irtm"nt  of  I’syhol o.*v 
University  of  Central  Florida 
Orlando.  FL  12816 

1  Dr.  Willi  am  Sims 

Center  for  Naval  Anil  vs  Is 
2)0  North  B"iure<*ard  Stre.-t 
Alexandria.  V\  '>2)11 

I  Dr.  If.  Wal  I  ice  Sin  ilko 
ProRr.iai  01  rector 

M  inonw-r  R"9;'  irrh  ml  Advisory  Services 
Smithsonian  Institution 
HOI  North  Pitt  Street 
Alexandria,  V\  22114 


l  PROF.  FIJMIKO  SAME  J  IMA 
DEPT.  OF  PSYCII  U.OGV 
UNIVERSITY  OF  TENNESSEE 
KNOXVILLE.  TN  17916 

l  Frank  L.  Schmidt 

0*pirtm*nt  of  Psychology 
SldR.  GG 

GeorR"  WishlnRton  University 
WishlnRton.  DC  20052 


I  Or.  Rob?rt  Sternh’rR 
D«pt.  of  Psycholoe.v 
Yale  University 
Box  1 1  A ,  Y  i  le  Station 
N>w  Haven.  CT  05520 

l  Or.  Peter  Stoloff 

Center  for  Niv.al  Anilysls 
200  North  H  ?.mret*ar<l  Stre-t 
Alexandria,  VA  22111 


1  Or.  Walter  Schneider  l  Or.  William  Stout 

Psychology  Department  University  of  Illinois 

601  E.  Dinlcl  D»oirt<mnt  of  Mathematics 

Champ alRn .  IL  61820  Urbina.  IL  61801 


Private  Sector 


Private  Sector 


1  Or.  Hirlhinn  Sunlnittiin 

Laboratory  of  Psychom  *trlc  an  ! 
Evaluitlon  Research 
School  of  Education 
University  of  Mass achus ?cts 
Amh  *rst .  M\  01011 

1  Dr.  Klkuml  Tatsuoka 

Computer  Rased  Education  R ‘Search  Lah 
252  Ene, Insert n<»  R>se  trch  l.abnr  itorv 
llrbiiai,  11.  51801 

l  Dr.  Miurlce  Titsuoki 
220  Edu~  it (on  Sid? 

1110  S.  Sixth  St. 

ChampilRn.  IL  51820 

I  Dr.  David  Thlss'*o 

l)’pirtm*nt  of  Psychology 
University  of  Kinsis 
Lavri'oc  >,  KS  58044 

1  Dr.  Robert  Tsutakiwi 
Department  of  Statistics 
University  of  Missouri 
Columbia,  MO  55201 

l  Dr.  J.  Uhl  m-'r 

Uhl  iner  Consultants 
4258  Son  ivft.a  Drive 
Enel  no  ,  CA  014 15 

1  Dr.  V.  R.  R.  Uppuluei 
Union  Carbide  Corporation 
Nuclear  Division 
P.  0.  Box  Y 
Oak  Rld?e,  TN  17810 

1  Dr.  Dnv  Id  Vile 

Assessment  Systems  Corporitlon 
2211  University  Avenu  • 

Suite  110 

St.  Paul .  MN  551 14 

1  Dr.  Hawtrd  W  ilnar 

Division  of  Psychological  Stulles 
Educational  Testln?  Strvlce 
Princeton.  Nl  08540 

l  Dr.  Mlcha-l  T.  Waller 

Oapirtment  of  Elu~ationil  Psychology 
University  of  Wisconsin — Milwaukee 
Milwaukee,  Wl  51201 


1  Dr.  Srlan  Witers 
HumRRO 

10.1  North  Wishln?ton. 

Alexandria,  V\  22114 

I  Dr.  David  !.  W*lss 
N450  Elliott  II  ill 
University  of  Minn  -sot  i 
75  E.  River  Ro i 1 
Minneapolis,  MN  554l>5 

1  Dr.  Rani  R.  Wl Irox 

University  of  Southern  Callfo-nl  i 
l)>n  irtn’nt  of  Psy-holoey 
I.OS  An«»e|es,  C\  010  1/ 

I  Wolffian?  Wlldi»riihe 
StrelCkra-f  teant 
Box  20  50  0 1 
D-5  10  )  Bonn  2 
WEST  GERMANY 

I  Dr.  Bruce  Wl 1 1  I  irns 

I)  *p  irtm ’nt  of  Educational  Psychology 
University  of  Illinois 
Urbina,  IL  518)1 

l  Dr.  W *ndy  Yen 
CTd/M-Gr.aw  II  1  l 
D»l  Monte  Research  Park 
Monterey,  CA  01940 


